智能论文笔记

Learning Binary and Sparse Permutation-Invariant Representations for Fast and Memory Efficient Whole Slide Image Search

Sobhan Hemati , Shivam Kalra , Morteza Babaie , H. R. Tizhoosh

分类：计算机视觉

2022-08-29

学习合适的全幻灯片图像（WSIS）表示有效检索系统是一项非平凡的任务。从当前方法中获得的WSI嵌入在欧几里得空间中并不理想有效的WSI检索。此外，由于同时处理多组贴片，因此大多数当前方法都需要高GPU存储器。为了应对这些挑战，我们提出了一个新颖的框架，用于利用深层生成建模和Fisher向量学习二进制和稀疏的WSI表示。我们引入了新的损失功能，以学习稀疏和二进制置换不变的WSI表示，采用基于实例的培训来提高记忆效率。在癌症基因组地图集（TCGA）和肝脏-Kidney-Stomach（LKS）数据集上验证了博学的WSI表示。在检索准确性和速度方面，该方法的表现优于Yottixel（最新的组织病理学图像搜索引擎）。此外，我们在公共基准LKS数据集中对SOTA实现了竞争性能，以进行WSI分类。

translated by 谷歌翻译

ProxyFL: Decentralized Federated Learning through Proxy Model Sharing

Shivam Kalra , Junfeng Wen , Jesse C. Cresswell , Maksims Volkovs , Hamid R. Tizhoosh

分类：机器学习

2021-11-22

在金融和医疗保健等高度监管域中的机构通常存在围绕数据共享的限制性规则。联合学习是一种分布式学习框架，可以实现对分散数据的多机构合作，并改善了每个合作师的数据隐私的保护。在本文中，我们提出了一种用于分散的联邦学习的通信有效的方案，称为ProxyFL或基于代理的联合学习。 ProxyFL中的每个参与者都维护了两个模型，私人模型和旨在保护参与者隐私的公开共享代理模型。代理模型允许参与者之间的高效信息交换，使用PushSum方法而无需集中式服务器。所提出的方法通过允许模型异质性消除了规范联合学习的显着限制;每个参与者都可以拥有任何架构的私有模型。此外，我们通过代理通信的协议导致使用差异隐私分析的隐私保障更强。对流行的图像数据集的实验，以及使用超过30,000多个高质量的千兆的千兆子痫组织的泛癌诊断问题整个幻灯片图像，表明ProxyFL可以优于现有的现有替代方案，越来越少的沟通开销和更强大的隐私。

translated by 谷歌翻译

Quantum-Inspired Tensor Neural Networks for Option Pricing

Raj G. Patel , Chia-Wei Hsing , Serkan Sahin , Samuel Palmer , Saeed S. Jahromi , Shivam Sharma , Tomas Dominguez , Kris Tziritas , Christophe Michel , Vincent Porte

分类：机器学习

2022-12-28

Recent advances in deep learning have enabled us to address the curse of dimensionality (COD) by solving problems in higher dimensions. A subset of such approaches of addressing the COD has led us to solving high-dimensional PDEs. This has resulted in opening doors to solving a variety of real-world problems ranging from mathematical finance to stochastic control for industrial applications. Although feasible, these deep learning methods are still constrained by training time and memory. Tackling these shortcomings, Tensor Neural Networks (TNN) demonstrate that they can provide significant parameter savings while attaining the same accuracy as compared to the classical Dense Neural Network (DNN). In addition, we also show how TNN can be trained faster than DNN for the same accuracy. Besides TNN, we also introduce Tensor Network Initializer (TNN Init), a weight initialization scheme that leads to faster convergence with smaller variance for an equivalent parameter count as compared to a DNN. We benchmark TNN and TNN Init by applying them to solve the parabolic PDE associated with the Heston model, which is widely used in financial pricing theory.

translated by 谷歌翻译

Generalised agent for solving higher board states of tic tac toe using Reinforcement Learning

Bhavuk Kalra

分类：人工智能

2022-12-23

Tic Tac Toe is amongst the most well-known games. It has already been shown that it is a biased game, giving more chances to win for the first player leaving only a draw or a loss as possibilities for the opponent, assuming both the players play optimally. Thus on average majority of the games played result in a draw. The majority of the latest research on how to solve a tic tac toe board state employs strategies such as Genetic Algorithms, Neural Networks, Co-Evolution, and Evolutionary Programming. But these approaches deal with a trivial board state of 3X3 and very little research has been done for a generalized algorithm to solve 4X4,5X5,6X6 and many higher states. Even though an algorithm exists which is Min-Max but it takes a lot of time in coming up with an ideal move due to its recursive nature of implementation. A Sample has been created on this link \url{https://bk-tic-tac-toe.herokuapp.com/} to prove this fact. This is the main problem that this study is aimed at solving i.e providing a generalized algorithm(Approximate method, Learning-Based) for higher board states of tic tac toe to make precise moves in a short period. Also, the code changes needed to accommodate higher board states will be nominal. The idea is to pose the tic tac toe game as a well-posed learning problem. The study and its results are promising, giving a high win to draw ratio with each epoch of training. This study could also be encouraging for other researchers to apply the same algorithm to other similar board games like Minesweeper, Chess, and GO for finding efficient strategies and comparing the results.

translated by 谷歌翻译

Design of an All-Purpose Terrace Farming Robot

Vibhakar Mohta , Adarsh Patnaik , Shivam Kumar Panda , Siva Vignesh Krishnan , Abhinav Gupta , Abhay Shukla , Gauri Wadhwa , Shrey Verma , Aditya Bandopadhyay

分类：机器人

2022-12-04

Automation in farming processes is a growing field of research in both academia and industries. A considerable amount of work has been put into this field to develop systems robust enough for farming. Terrace farming, in particular, provides a varying set of challenges, including robust stair climbing methods and stable navigation in unstructured terrains. We propose the design of a novel autonomous terrace farming robot, Aarohi, that can effectively climb steep terraces of considerable heights and execute several farming operations. The design optimisation strategy for the overall mechanical structure is elucidated. Further, the embedded and software architecture along with fail-safe strategies are presented for a working prototype. Algorithms for autonomous traversal over the terrace steps using the scissor lift mechanism and performing various farming operations have also been discussed. The adaptability of the design to specific operational requirements and modular farm tools allow Aarohi to be customised for a wide variety of use cases.

translated by 谷歌翻译

What do you MEME? Generating Explanations for Visual Semantic Role Labelling in Memes

Shivam Sharma , Siddhant Agarwal , Tharun Suresh , Preslav Nakov , Md. Shad Akhtar , Tanmoy Charkraborty

分类：自然语言处理

2022-12-01

Memes are powerful means for effective communication on social media. Their effortless amalgamation of viral visuals and compelling messages can have far-reaching implications with proper marketing. Previous research on memes has primarily focused on characterizing their affective spectrum and detecting whether the meme's message insinuates any intended harm, such as hate, offense, racism, etc. However, memes often use abstraction, which can be elusive. Here, we introduce a novel task - EXCLAIM, generating explanations for visual semantic role labeling in memes. To this end, we curate ExHVV, a novel dataset that offers natural language explanations of connotative roles for three types of entities - heroes, villains, and victims, encompassing 4,680 entities present in 3K memes. We also benchmark ExHVV with several strong unimodal and multimodal baselines. Moreover, we posit LUMEN, a novel multimodal, multi-task learning framework that endeavors to address EXCLAIM optimally by jointly learning to predict the correct semantic roles and correspondingly to generate suitable natural language explanations. LUMEN distinctly outperforms the best baseline across 18 standard natural language generation evaluation metrics. Our systematic evaluation and analyses demonstrate that characteristic multimodal cues required for adjudicating semantic roles are also helpful for generating suitable explanations.

translated by 谷歌翻译

Domain-aware Self-supervised Pre-training for Label-Efficient Meme Analysis

Shivam Sharma , Mohd Khizir Siddiqui , Md. Shad Akhtar , Tanmoy Chakraborty

分类：自然语言处理 | 人工智能

2022-09-29

现有的自我监督学习策略被限制在有限的目标或主要针对单峰应用程序的通用下游任务。对于复杂性和域亲和力（例如模因分析）而言，这对命令性的多模式应用有了孤立的进展。在这里，我们介绍了两种自我监督的预训练方法，即ext-pie-net和mm-simclr（i）在预训练期间使用现成的多模式仇恨语音数据，并且（ii）执行自我 - 通过合并多个专业借口任务，有效地迎合模因分析所需的复杂多模式表示学习，从而有效地迎合了学习。我们实验不同的自我实验策略，包括可以帮助学习丰富的跨模式表示并使用流行的线性探测来评估可恨模因任务的潜在变体。拟议的解决方案通过标签有效的培训与完全监督的基线竞争，同时在梅诺特挑战的所有三个任务上明显优于他们，分别为0.18％，23.64％和0.93％的绩效增长。此外，我们通过在Harmeme任务上报告竞争性能来证明所提出的解决方案的普遍性。最后，我们通过分析特定于任务的学习，使用更少的标记培训样本来建立学习表现的质量，并争辩说，自主策略和手头下游任务的复杂性是相关的。我们的努力强调了更好的多模式自学方法的要求，涉及有效的微调和可推广性能的专业借口任务。

translated by 谷歌翻译

Efficiently Learning Recoveries from Failures Under Partial Observability

Shivam Vats , Maxim Likhachev , Oliver Kroemer

分类：机器人 | 人工智能 | 机器学习

2022-09-27

在现实世界条件下运行的原因是由于部分可观察性引起的广泛故障而具有挑战性。在相对良性的环境中，可以通过重试或执行少量手工恢复策略之一来克服这种失败。相比之下，诸如打开门和组装家具之类的接触式连续操作任务不适合详尽的手工设计。为了解决这个问题，我们提出了一种以样本效率的方式来鲁棒化操作策略的一般方法。我们的方法通过在模拟中探索发现当前策略的故障模式，从而提高了鲁棒性，然后学习其他恢复技能来处理这些失败。为了确保有效的学习，我们提出了一种在线算法值上限限制（值UCL），该算法选择要优先级的故障模式以及要恢复到哪种状态，以使预期的性能在每个培训情节中最大程度地提高。我们使用我们的方法来学习开门的恢复技能，并在模拟和实际机器人中对其进行评估。与开环执行相比，我们的实验表明，即使是有限的恢复学习也可以从模拟中的71 \％提高到92.4 \％，从75 \％到90 \％的实际机器人。

translated by 谷歌翻译

Multiple Waypoint Navigation in Unknown Indoor Environments

Shivam Sood , Jaskaran Singh Sodhi , Parv Maheshwari , Karan Uppal , Debashish Chakravarty

分类：机器人

2022-09-18

室内运动计划的重点是解决通过混乱环境导航代理的问题。迄今为止，在该领域已经完成了很多工作，但是这些方法通常无法找到计算廉价的在线路径计划和路径最佳之间的最佳平衡。除此之外，这些作品通常证明是单一启动单目标世界的最佳性。为了应对这些挑战，我们为在未知室内环境中进行导航的多个路径路径计划者和控制器堆栈，在该环境中，路点将目标与机器人必须在达到目标之前必须穿越的中介点一起。我们的方法利用全球规划师（在任何瞬间找到下一个最佳航路点），本地规划师（计划通往特定航路点的路径）以及自适应模型预测性控制策略（用于强大的系统控制和更快的操作）。我们在一组随机生成的障碍图，中间航路点和起始目标对上评估了算法，结果表明计算成本显着降低，具有高度准确性和可靠的控制。

translated by 谷歌翻译

The SZ flux-mass ($Y$-$M$) relation at low halo masses: improvements with symbolic regression and strong constraints on baryonic feedback

Digvijay Wadekar , Leander Thiele , J. Colin Hill , Shivam Pandey , Francisco Villaescusa-Navarro , David N. Spergel , Miles Cranmer , Daisuke Nagai , Daniel Anglés-Alcázar , Shirley Ho

分类：人工智能 | 机器学习

2022-09-05

光环伴形培养基中的离子气体通过热阳光阳光层（TSZ）效应在宇宙微波背景上留下烙印。来自活性银河核（AGN）和超新星的反馈会影响晕孔集成TSZ通量的测量（$ y_ \ mathrm {sz} $），并导致其与光晕质量的关系（$ y_ \ mathrm {sz} -mm $ ）偏离病毒定理的自相似幂律预测。我们对使用骆驼，一套流体动力模拟的套件进行了全面研究，反馈处方的差异很大。我们使用两个机器学习工具（随机森林和符号回归）的组合来搜索$ y-m $关系的类似物，这对低质量的反馈过程（$ m \ sillesim 10^{14} \，h^， {-1} \，m_ \ odot $）;我们发现，仅替换$ y \ rightarrow y（1+m _*/m_ \ mathrm {gas}）$在关系中使其非常相似。这可以用作低质量簇和星系组的强大多波长质量代理。我们的方法通常对于提高其他天体分级关系的有效性领域通常也很有用。我们还预测，$ y-m $关系的测量值可以在反馈参数的某些组合和/或排除超级新闻和AGN反馈模型的主要部分，以提供百分比的约束。艺术流体动力模拟。我们的结果对于使用即将进行的SZ调查（例如SO，CMB-S4）和Galaxy Surveys（例如Desi和Rubin）来限制Baryonic反馈的性质。最后，我们发现，$ y-m _*$的另一种关系提供了有关反馈的补充信息，而不是$ y-m $。

translated by 谷歌翻译